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Abstract — In many communication scenarios, such as in cel- 
lular systems, the energy cost is substantial and should be 
conserved, yet there is a growing need to support many real- 
time applications that require timely data delivery. To model 
such a scenario, in this paper we consider the problem of 
minimizing the expected sum energy of delivering a message 
of a given size from a source to a destination subject to a 
deadline constraint. A relay is present and can assist after it has 
decoded the message. Causal channel state information (CSI), 
in the form of present and past SNRs of all links, is available 
for determining the optimal power allocation for the source and 
relay. We obtain the optimal power allocation policy by dynamic 
programming and explore its structure. We also obtain conditions 
for which the minimum expected sum energy is bounded given 
a general channel distribution. In particular, we show that for 
Rayleigh and Rician fading channels, relaying is necessary for the 
minimum expected sum energy to be bounded. This illustrates the 
fundamental advantage of relaying from the perspective of energy 
efficient communications when only causal CSI is available. 
Numerical results are obtained which show the reduction in the 
expected sum energy under different communication scenarios. 

Index Terms — energy-efficient wireless communications, re- 
laying, deadline, mutual information accumulation, dynamic 
programming. 

I. Introduction 

A. Background 

In delay-sensitive applications, such as multimedia stream- 
ing, information is considered to be useful only if it is 
delivered within a deadline. This communication scenario is 
modeled in 0~)-|[3) as slotted transmissions with deadline, i.e., 
a fixed number of bits are to be delivered over K > 1 slots 
by a source S to a destination D. Prior to every transmission, 
causal channel state information (CSI), consisting of only the 
past and present slots (but not of future slots), is assumed to be 
known to S. The problem of minimizing the sum transmission 
energy, by allocating energy dynamically over K slots, was 
first formulated as a finite-horizon dynamic program in JTJ. 
The problem was then specialized to the case where the 
energy-bit relationship is governed by the AWGN channel 
capacity formula in |2 j. For such problems with availability of 
only causal CSI, analytical closed-form solutions are typically 
not available (TJ, 0. The optimality of some scheduling 
policies was proved in some asymptotic regimes in In 
other related works, different deadlines were considered for 
packets that arrived separately in time 0], while a continuous- 
time framework was considered in J5]. 

This work was presented in part at IEEE ICC, Ottawa, Canada, June 2012. 
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Fig. 1. A two-phase decode-and-forward protocol consisting of k = 
1, ■ ■ ■ , K slots. In Phase 1, source transmits with power p| and relay decodes. 
In Phase 2, source and relay transmit with power and p5, respectively. 



Advanced relaying techniques, especially cooperative relay- 
ing where the source and relay transmit jointly, have been 
shown to increase the achievable rate J6], reduce the error 
probability Q, or minimize energy ll8l- lfT4l . The results in 
l6l- lfT0l correspond to the single-slot system where K = 1. In 
lfTT1l - lfT4l . energy minimization was performed over multiple 
slots, allowing further gains to be achieved by optimizing over 
multiple slots. In ifTTIl . ifPHl . close-to-optimal schemes were 
obtained subject to a long-term average delay constraint, but 
this can lead to a large delay for some message bits. In |[T3l . 
the optimal scheduler was obtained with the assumption that 
previously received packets were not used for joint decod- 
ing. Recently in lfT4l . we considered the problem of energy 
minimization subject to a deadline, assuming causal CSI is 
available based on a decode-and-forward relaying scheme ||6). 
We have considered intra-slot relaying, where every slot is 
divided into Phase 1 when the relay listens and Phase 2 when 
the relay can transmit. Although the power allocation and the 
phase durations were optimized jointly over all slots, channel 
coding and decoding was performed independently for each 
slot, i.e., earlier received packets were not used for joint decod- 
ing by the receivers. If all received packets are instead jointly 
decoded by all receivers, as in lfl5l - lfT7l . then we can realize 
mutual-information accumulation (MIA). MIA increases the 
achievable rate and conversely saves transmission power for 
a given transmission rate. The effect of MIA is similar to the 
use of an ARQ scheme that employs incremental redundancy 
via retransmissions ||T8l . except that retransmissions are now 
deliberate and both R and D perform decoding. In these works 
1 15]-| 17], full CSI. consisting of the past, present and future 
channel states, is assumed to be available for rate and power 
allocation. Full CSI, however, may not be available in practice. 

B. Problem 

In this paper, we consider a slotted system where a relay 
R helps source S to deliver a given number of bits to D 
within a deadline constraint of K slots, based on the two-phase 
decode-and-forward protocol shown in Fig. Q] Different from 
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the literature, we consider: (i) only causal CSI is available for 
power (and hence also rate) allocation, (ii) inter-slot relaying, 
i.e., the transition from Phase 1 to Phase 2 occurs only at slot 
boundaries, and (iii) MIA, i.e., both R and D use all received 
packets for joint decoding. The motivation of using causal 
CSI and inter-slot relaying is that it is readily implemented in 
practice, while the motivation of using MIA is to reduce the 
transmission energy by the use of advanced coding techniques. 

We seek to minimize the expected sum energy, by choosing 
the transmission power of each node dynamically slot by slot. 

C. Contributions 

Solving the problem in closed-form is challenging, as the 
causality of the CSI implies that the present power allocation 
probabilistically affects the future slots, and the use of MIA 
creates a further dependence of the power allocation over 
time. Nevertheless, we obtain the optimal power allocation 
policy by dynamic programming, and explore the structural 
and analytical properties to provide interesting fundamental 
insights. Our specific contributions are as follows. 

• We formulate our problem using a system-level state 
diagram that depends on whether the relay is active, i.e., 
if the system is in Phase 1 or 2 as shown in Fig. Q] This 
allows us to obtain the optimal power allocation policy 
via dynamic programming. 

> We obtain conditions for which the minimum expected 
sum energy is bounded given the channel distribution. In 
particular, we show that for Rayleigh and Rician fading 
channels, relaying is necessary for the minimum expected 
sum energy to be bounded, i.e., without relaying, the 
expected sum energy is unbounded even with an optimal 
power allocation. This points to the fundamental advan- 
tage of relaying from the perspective of energy efficient 
communications when only causal CSI is available. 

• Closed-form results are obtained for specific cases to 
reveal interesting insights. In particular, our problem of 
power allocation is related to the problem of deciding at 
which slot the relay should be activated (by making the 
relay decode the message). For K = 2 slots, this problem 
is solved by minimizing a piecewise convex objective 
function. 

• Numerical results are obtained for different scenarios to 
show the potential energy savings, suggesting that most 
of the savings are recovered with K = 2. We propose 
a heuristic policy with provably bounded expected sum 
energy; for K = 2, it performs close to the optimal 
scheme for small rates. 

This paper is organized as follows. First, Section [U] gives 
the system model and formulates the problem of minimizing 
the expected sum energy. Section [HI] solves this problem 
and analyzes properties of the optimal solution. Section [IV] 
obtains general conditions for which the expected sum energy 
is bounded. Numerical results and comparisons are given in 
Section [V] Finally, Section |VI] concludes the paper. 

II. System Model 

A source S delivers a message W of nB nats over K € Z + 
slots to a destination D with the help of a relay R. Each 
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Fig. 2. Protocol for K = 3, assuming the relay decodes the message w as 
w in slot K = 1. 



slot consists of n channel uses over time. Thus, the effective 
transmission rate is i? e ff = B/K nats per channel use. 
Each node has one antenna and is subject to the half-duplex 
constraint. 

A. Coding Scheme 

The source S uses independent Gaussian codebooks for 
each slot k. For every message W <G {!,••• ,e™ S }, a 
codeword x^(W^) of length n is generated at S according to 
CJ\f(O n ,I n ), which denotes the independent and identically 
distributed (i.i.d.) complex-valued zero-mean unit-variance 
n-variate Gaussian probability density function (PDF). The 
relay independently generates its codebook, with codewords 
{x%(W)}, similarly. 

B. Relaying Protocol 

We divide the relaying protocol into two phases, as shown 
in Fig. Q] We employ a decode-and-forward relaying scheme 
where each receiver (R or D) can accumulate mutual informa- 
tion over slots. An example of the power allocation for K = 3 
slots is shown in Fig. [2] 

We assume the wireless communication scenario where the 
link SNR is constant in each slot, denoted by the positive 
scalar 7, while the channel phases may vary over n channel 
uses in each slot, denoted by the length-n vector 0. This as- 
sumption is reasonable in practice as communication systems 
are typically designed such that, over a packet or slot duration, 
the SNR is constant, while the channel phases may change 
substantially due to, for example, unavoidable offsets in the 
carrier frequencies. 

In general, for any Y-to-X link in slot k, let 7^ x > 
be the channel SNR and let &J, X be a diagonal matrix of 
the n channel phases #^ x . Every receiver X has knowledge 
of both 7^ x and 6j x , such as via appropriate training with 
preambles sent in the packet header. Since the link SNRs, but 
not their phases, change very slowly in our assumption, in 
practice the SNRs can be fed back in time by the receivers. 
Hence, we assume that every transmitter Y has knowledge of 
7^ x but not 6^ x . This knowledge is used for power allocation 
in Section III-CI 

1) Phase 1: Consider Phase 1, before the relay has decoded 
the message W. The source transmits the codeword with 
transmission power p|. The codeword is then received at 
node X € {R, D}, which denotes the relay or the destination 
respectively, as 

y* = VphF® s fe x xS + v x (i) 



3 



where v* ~ CAf(O n ,I n ) is the additive white Gaussian noise 
(AWGN). Using all past received packets {yf, i = 1, ■ ■ ■ , k} 
for joint typical decoding, the relay reliably decodes message 
W if IT9l 



"£i(ph! R )>B 



(2) 



where I(x) = log(l + 7) is the mutual information function 
for point-to-point Gaussian channels with received SNR 7 > 
and log is the natural logarithm. We note that /(•) is a strictly 
increasing concave function. Thus, the mutual information 
accumulates over slots until it reaches or exceeds B, upon 
which the relay reliably decodes W. 

2) Phase 2: We denote K as the earliest possible slot 
index such that (O holds, where 1 < K < K — 1; if 
K > K, relaying is not possible and so Phase 2 is not 
activated. Consider Phase 2 that runs from slot k = K + 1 to 
k = K, in which the relay can perform relaying. Both S and R 
transmit concurrently using their independent codebooks. The 
destination thus receives 



R„,RD qRD 



Pklk 



(3) 



for K + 1 < k < K. Since typically the channel phases can 
change significantly, we assume the extreme case of a phase 
fading channel J6j, i.e., each element in SD and RD is i.i.d. 
with a uniform PDF over [0, 2ir). Using all received packets 
{y^, k = 1, • • • , K} for joint typical decoding, the destination 
reliably decodes message W if J6) 



A 

E 

fe=i 



S„,SDn 



A 

E 

k=K+l 



S„,SD 



■ P hn>B. 



(4) 



Remark 1: To obtain |@), we use independent codebooks at 
S and R, and assume a phase fading channel. Alternatively, 
suppose we use an identical (Gaussian) codebook at both 
nodes followed by a distributed space-time block cod^H 0. 
Then, regardless of the PDF of the phases, the SNRs combine 
as Pfe7fe D +-PfeTfc D m tne secon d phase and so we obtain (0]i too. 
Hence our subsequent result applies equally for both cases. 



C. Problem Statement 

We consider a general problem formulation where for slot 
k £ JC = {1,2, • • • ,K}, some CSI is available in some 
CSI space Sk- Let = (p|,p R ) be the power allocation 
and 7 fe = (7^, 7^°, 7 RD ) be the link SNRs in slot k. We 
denote a power allocation policy it £ IT as a mapping of the 
CSI Sfc £ Sk to the non-negative power allocation for all 
k £ JC, where II is the space of all feasible policies, i.e., 



7T = {Pfc(s fe ) > 0,Vs fc £ S kl k £ K,}. 



(5) 



Our problem is to find the optimal power allocation policy 
7r* so as to minimize the expected sum energy, subject to 

1 In conventional distributed space-time block codes, the power is fixed, but 
we shall optimize the power given causal CSI. We shall subsequently show 
that it is optimal that only one node transmits and so there is, in fact, no need 
to use a distributed space-time block code. 



delivering B nats per channel use in K slots based on the 
described relaying protocol. We assume an initial CSI Si is 
(arbitrarily) given. Mathematically, the optimization problem 
is given by Problem P0: 



min E 

•n-en 



A" 



J2p s k+p R k 



LAi=l 



Si 



(6a) 



s.t. K = argmin j 1 < k < K : I(phf R ) > 5 j (6b) 



A' 



A 



E^ D )+ E i(pht D +Phn>B (60 

k=K+l 

where E(-) is the expectation over all random variables (condi- 
tioned on the initial CSI Si). The inequalities in (l6bl and (l6cl > 
are due to <J2J and ©, to ensure that the message is decoded 
by the relay and the destination at the end of K and K slots, 
respectively. 

In the next section, we shall solve Problem P0 for the 
specific case where causal CSI is available. 

III. Optimal Power Allocation Policy 

We first formulate Problem P0 equivalently as Problem PI 
given causal CSI is available in Section UlI-AI Then we define 
a system-level state diagram in Section IIII-BI which solves 
Problem PI by dynamic programming in Section IIII-CI We 
provide further analysis in Section Ull-Dl for K = 2 slots. 

For convenience, we denote a collection of scalars or vectors 
with running subscripts as x\ = (xk, Xk+i, ■ • • where 
k = max{l, i} if i < j and j > 1; otherwise, we let x\ = 0. 

A. Equivalent Problem 

In this paper, we assume that at the start of each slot k £ tC, 
causal CSI is available at all nodes, in terms of the present 
and past link SNRs 7^ = (71,72, • • • ,7*0- 

Before we specialize to the causal CSI case, we make the 
following simplification in Lemma Q] 

Lemma 1: Suppose that the CSI is available in slot k £ 
JC. If the present link SNRs 7^ is a deterministic function of 
Sfc for k £ tC, then there is no loss in optimality in solving 
Problem P0 if only the stronger node, i.e., either S or R with 
the larger (or the same) SNR to D, transmits in each slot in 
Phase 2. 

Proof: See Appendix lAl ■ 
The causal CSI explicitly includes the SNR 7^. Thus, 
Lemma 1 trivially applies. This observation us to reduce the 
number of power allocation variables, as follows. Let B R be 
the additional amount of mutual information required by R to 
reliably decode the message in slot k. 

• If P R < (i.e., slot k is in Phase 2), only the stronger 
node transmits. We denote the transmission power as 
Pk > 0, with the corresponding link SNR given by 

7^max( 7 f D ,7, RD ). 

• If P R > (i.e., slot k is in Phase 1), we re-write p\ as 



p k with the SNR still as 7^ 



SD 
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Henceforth, there is only one scalar power pk to be optimized 
for every slot, and the policy (O simplifies as 

n = { Pk {s k )>oy Sk eS k ,keJC}. (7) 

Thus Problem PO is equivalent to Problem PI, given by 

K 



Phase 1 



Phase 2 



Phase T 



min E 



Lfc=i 



(8a) 



K { \ 

Y^nvu(iT^{Bi > o)+7a(5£ <o))) >b m 

k=l ^ ' 

where l(-) is an indicator function that takes the value of 1 
if the condition in the argument is true and otherwise. Note 
that via power allocation, we also implicitly determine K as 
defined in ( l6cl >. 



s.t. 



B. State Diagram 

Similar to Bl let be the additional amount of mutual 
information required by D to reliably decode the message in 
slot k. As k increases, both PP and B® decrease or remain 
the same as they receive more mutual information. We assume 
Bf — Bi = KR e ff, meaning that no mutual information is 
available for both nodes initially; it is easy to extend to the 
general case where each has some initial side information. 
From d2J and ©, we have 



B° k = 



B k —i 
B D k . 



I(pk-ilt H -i) 



Bl 
Bl 

Bl 
Bl 



> (Phase 1); 

< (Phase 2) 

> (Phase 1); 

< (Phase 2) 



(9a) 



(9b) 



i - I(Pk- 
i - I(Pk-Uk- 

for k > 2. We note that d9al l reflects that the relay does not 
accumulate mutual information in Phase 2 as it has decoded 
the message, while ( |9~bl reflects that if the relay can start to 
perform relaying in Phase 2, then the equivalent SNR is given 
by 7 fc _i instead of 7^. 

Besides 7^, let b/. = (5^,5°) also be available as causal 
CSI; there is no loss of generality, since can be calculated 
from the past power allocation based on the given policy ir. 
Thus, the causal CSI for slot k is 



s fc = (7i',b fc ) e S k 



(10) 



where S k is the corresponding causal CSI space such that 7* 
and bfe are non-negative. This definition of the (extended) CSI 
is useful for us to define the state diagram next. 

We introduce the termination phase T to indicate that B D < 
0, i.e., D has decoded the message. Consider a slot k G K, in 
Phase 4>k € {l7 2,T}. We refer to (k,<fik) as the system state 
and the corresponding causal CSI Sfc as its state valu^l The 
transitions of the system states occur as follows, see Fig. [3] 

• Suppose slot k = 1. The slot must be in Phase 1, hence 
the system states (1,2) and (1,T) are empty. 

• Suppose B® +1 < 0. The system state (k,<f>k) transits to 
(k + 1, T) and the transmission ends. 

2 The state value should be denoted as Sfc fc to reflect its full dependence. 
We maintain the notation for notational convenience, and also because the 
state value contains and B D which completely determine the phase 



slot 1 



slot 2 2,1 



slot 3 3,1 



slot K 




Fig. 3. All possible transitions of system state (fc, </>j.) for slot k in Phase <j>k- 
A transition to (fe + 1, T) that results in termination occurs if < 0. 
Otherwise, a transition to (fc + 1, 2) at Phase 2 occurs at the first occurrence 
of BP < 0. 



• Suppose B®, x > and the slot k > 1 is in Phase 1. 
The system state (fc, 1) transits to (k + 1, 2) if B% +1 < 0, 
as the relay has reliably decoded the message after slot 
k; otherwise the system state transits to (k + 1, 1) and 
remains in Phase 1. 

• Suppose B® +:1 > and the slot k > 1 is in Phase 2. The 
system state (k, 2) always transits to (k+ 1, 2) as it stays 
in Phase 2. 

Remark 2: The state diagram, and hence the subsequent 
results, apply also to the case where no relaying is performed. 
This is done by decreasing the link SNR 7^ from the source 
to relay to zero, in which case Phase 2 will never occur. 

C. Dynamic Programming 

The link SNRs 7^ are treated in general as random vari- 
ables with PDF J^k(-). We treat p^. and also as random 
variables, since they depend implicitly on the link SNRs via 
the policy n. 

Theorem Q] solves Problem PI. We use the Bellman's 
principle of optimality 11201 . in which the optimal policy 
can be obtained by recursively solving a set of so-called 
Bellman's equation (viz. ( fTTT )) with the same problem structure 
(viz. a minimization in (II lbb ). It is convenient and intuitive 
that the indexing of Bellman's equation in (fTTT) is two- 
dimensional over (k,<pk), which follows naturally from the 
two-dimensional state diagram in Fig. |3] In contrast, if there 
is no relaying, the indexing is one-dimensional |2), Q. 

We denote the inverse of the mutual information function 
as I^ 1 {x) = exp(x) — l,x > 0. 

Theorem 1: Given causal CSI Si at slot 1, the minimum 
expected sum energy in Problem PI is given by Ji i(si), 
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which is computed recursively for decreasing k 
1, ■ ■ ■ ,1, as follows: 



K, K 



Jfc,0 fe (s fe ) 

Jk+ij(s k ) 



mm p fc 

Pfc>0:B° +1 <0 



6 a- = 1; 
h< = 2, 



(11a) 



min pjfe+E^ B [J fc+li ^ (s fc+ i)|^fc,s fc ], 

Pfc>0 



0. 



for fc e K\K,<l> k e {1,2}, (lib) 
forfce/C. (11c) 



An optimal policy tt* that achieves Ji.i(si) is given by the 
power allocation p k (s k ) that solves dl la| > and d 1 lbb for all 
possibl^l system states (k,4> k ) and state values Sfe. 

Proof: See Appendix iBl ■ 

From Theorem Q] the minimum expected sum energy is 
given by Ji,i(si) that is computed by the Bellman's equation 
(fTTb . while the power allocation that solves the Bellman's 
equation form the optimal policy tt*. In dl lbl ). the first term 
p k is the energy used by the current slot, while the second 
term is the expected energy accumulated by all future slots 
k + 1, • • • ,K. Thus, besides minimizing the energy used by 
the current slot, the optimal policy also accounts for the energy 
used in the future. 

The complexity of solving the Bellman's equation via 
dynamic programming is P-complete, i.e., as hard as any 
problem with polynomial-time complexity, and so the solution 
likely cannot be obtained by highly parallel algorithms lETl . 
Hence, the (offline) computational complexity to obtain the 
optimal power allocation policy tt* is fairly high. However, 
the (online) implementation complexity can be made low by 
first storing tt* in a lookup table, which contains the mapping 
of all possible state values to the optimal power allocation. 
During online operation, the table is then used to allocate the 
power for every slot given the system state and state value. In 
practice, the state values are first quantized before the optimal 
policy is obtained and stored (as is done to obtain numerical 
results in Section[V]i. The exact granularity of the quantization 
depends on the tradeoff between the complexity /storage of the 
policy and the expected sum energy. 

Next, Theorem[2]reveals exactly how tt* leads to the optimal 
transition from Phase 1 to Phase 2. For full generality, we 
assume an mth-order Markovian channel where the PDF of 
the link SNRs factorize as 



K 



(12) 



k=l 



If m = K, this corresponds to the most general case where 
the PDF of the link SNRs is arbitrary and thus need not 
be Markovian. If m = 1, this corresponds to the first- 
order Markovian channel, which is widely considered in the 
literature to reflect the dependence of the present link SNRs 

3 To reduce computations, we need only consider a state value sj. if it is 
relevant to its system state [k, <j> k ). F° r example, if = 2, i.e., the system 
is in Phase 2, then (related only to the relay) is no longer relevant to 
future states and can be ignored. 



only with respect to the most recent past link SNRs, see e.g., 
11221 . 11231 . If m = 0, this corresponds to the case of i.i.d. link 
SNRs. 

Theorem 2: Assume an rath-order Markovian channel 
where the PDF of the link SNRs is given by O- To 
achieve the minimum expected sum energy in Problem PI, 
the Bellman equations dl lab , dl lcb hold while (1 1 lbb can be 
solved equivalently as 



Jk,i(sk) = 

^fc,l->-l(Sfe) = 
</fc,2(Sfc) = 



min{J fci i_^ 2 (sfc), Jfe,i->i(Sfc)} 
I-HRk) , 



mm 

0<R k <min{R^ ,B°} 



(13a) 

J'(Rk) (13b) 



mm 

min{K%,B°}<R k <B° % 

min ^y*) +j'"(R k ) 

0<R k <B° 7fc 



7f° 

J ~^p fe) +J"(R k ) (13c) 



(13d) 



where for a given Sfe, we denote 
J'(R k ) = E 7fc+1 [J fe+1 , 1 ( 7fc+1 ,b fc - (g{R k ),R k ))\ 7 %_ 



m+U 



J"(R k ) = E 7fc+1 [J fe+1 , 2 ( 7fc+1 ,b fc - (g(Rk),R k ))hk-m+i] 
J"\R k ) = E 7fc+1 [J fc+1 , 2 ( 7fc+1 ,b fc - (0,Rk)M- m +i} 



g(x) = I(l- 1 (xH R / 1 5 k °),x>0 
Rt=l(l-\B*H D /jl 



SR\ 



Proof: First, we prove (113 dl t . Suppose the system state is 
(k, 2). From Fig. [5] the system state can transit to (k + 1, 2) 
or (k + 1, T). We make a one-to-one transformation from the 
power variable p k to the rate variable R k = I{p k ^ k ), which 
represents the additional mutual information received by D in 
slot k. The second term in (11 lbb expands probabilistically to 
two cases: the system state transits to (k + 1, 2) if < R k < 
B% and to (k + 1, T) if R k > B k . To optimally allocate R k 
(or p k ), we minimizes over both cases. Thus dl lbb becomes 
JkA s k) = mm{Q',Q"} where 

I-HRk) 



Q' = min 

0<R k <B 



Q" 



mm 

Rk>B[ 



Ik 

i-HRk) 

Ik 



+ E S(j+1 [J fc +i, 2 (7fc+i,b fc+ i)|s fe ] 
E Sfc+1 [Jk+ijhk+i,h k+1 )\s k ] . 



From dl let , J k +ij{- : ■) = 0, thus it is optimal to let R k = B® 
in the minimization problem to obtain Q". To obtain Q', we 
simplify b^+i = b^ — (Q,R k ) by using (O for Phase 2. 
The only random variable remaining in the expectation in 
the minimization problem to obtain Q' is 7^+1, which only 
depends on ~f k _ m+1 due to the Markovian property in (Q~2). 
By the continuity of both cases at R k = B k , we obtain d!3dl ). 

We sketch the proof for (113aK which is more tedious but 
follows similarly as before. Suppose the system state is (k, 1). 
From Fig. [3] the system state can transit to (fc+1, 1), (fc + 1, 2) 
or (k + 1,T). We make a one-to-one transformation from p k 

to R k = i( P kil D ). if R < Rf = 1 (i-HBkU D hk R ) and 

R < B k , the system state transits to (k + 1, 1); otherwise 
the system state transits to (k + 1, 2) or (k + 1, T). Following 
similar steps, we get (|13ab , d!3bb and d!3cb . ■ 
Intuitively, R k represents the mutual information made 
available (via power allocation) to the destination in slot k; 
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R]y represents the mutual information threshold for Rk beyond 
which the system transits from Phase 1 to Phase 2. In (113a| >, 
Jk.i^i( s ) and >/fc.n-2(s) are the optimal sum energy from 
slot k to slot K, assuming the present slot is in Phase 1, and 
the next slot is in Phase 1 and Phase 2, respectively. Thus 
the minimization in (II 3at is interpreted as a decision making 
between remaining in Phase 1 or transiting to Phase 2. On 
the other hand, in ( 113dt the slot is already in Phase 2, thus 
such a choice need not be made. Overall, we can interpret 
the dynamic programming as determining the optimal mutual 
information or power to allocate, and hence also the optimal 
slot K where the transition occurs. 

Remark 3: To solve Problem PI, it suffices to use Sfc = 
(lk, Jk-m+i' kfc) € 5fe as a more compact causal CSI, instead 
of the original causal CSI ( fTOb . This is because to compute 
the Bellman equation in Theorem [2] only the compact causal 
CSI, instead of ( fTOl i. is required for all k = K, - ■ ■ ,1. Hence, 
the smaller the Markovian order m, the smaller the size 
of the causal CSI state space Sk- If m is small, both the 
computational complexity and implementation complexity of 
the optimal policy can thus be reduced significantly. 

D. Closed-form Solution for K — 2 Slots 

To gain further insight, we consider K = 2 slots which 
gives a closed-form solution. 

Let [x] h a equals a,x or b, respectively, if a; < a, a < 
x < b or x > b, and let [x] + = max(0, a;). Also, let 
$i 4 E 7 sd [l/ 7 | D |7i D ] and $ 2 4 E^ 2 [l/ 72 | 7 i]. Clearly, 

Corollary 1: Consider Problem PO with K = 2 and initial 
CSI s = ( 7 ,b) where 7 = ( 7l SR , 7 f D , 7l RD ), b = (B D ,B D ). 
Then the minimum expected sum energy is given by 

Ji i (s) = min < min Oi(iJ), min q 2 (R)) > (14) 

[0<R<B' B'<R<B° J 

= min{g 1 (R* 1 ),g 2 (R* 2 )}. (15) 

Here, g t (R) = ^$-+$ i I- 1 (B D -R), i = 1,2, corresponds 
to the objective functions in (II 3bb and d 1 3cb . respectively; 
Rl = [Rl}+ and R* 2 = [jR|]|° denote the correspond- 
ing optimal solutions, where B' = min{i? th , B D }, R th = 
I (J"- 1 (S D )'yf D / 7 ^ R ) and R° = (log( 7l SD $ 4 ) + B D )/2, i = 
1,2. 

Proof: Applying ( II lab , (II lcb and ( fL3l for K = 2, we 

get Ji,i(s) = min{ Ji,i_v 2 (s), Ji,i-n(s)} where J M ^i(s) = 
min <_R<B' gi{R), Ji.i^2(s) = min B /<^< B D 52 which 
proves (fl4l . Next, we prove (fTBI l. For convenience, a typical 
graph is shown in Fig. [4] 

Suppose jf D > 7l SR , thus R tb > B D and B' = B D . Then 
trivially Ji.i->. 2 (s) = g2(B D ). Now we consider J^i^^s). 
Since is convex, gi(-) is a convex function. The 

optimal rate R^ for the unconstrained minimization of gi(R) 
is obtained by solving the equation formed by setting the 
first derivative of gi(R) to zero, which is given by i?J = 
(log( 7 ^ D $i) + B D )/2. Since a feasible rate must satisfy 
< R < B D , and because gi(R) is convex, it follows that the 
optimal solution that minimizes gi(R) is given by R\ = [R1} + 
or B' — e, where e > is infinitesimally small. Without loss 



of generality, however, we do not need to consider B' — e 
as a possible candidate, since it cannot be optimal due to 
92(B') < gi(B'). It can be easily checked that Ji.i(s) is 
thus given by ( fTBI l for the case of 7 ^ D > 7 ^ R . Next, suppose 
7 i D < 7 i SF \ thus i? th < B D and B' = R tb . Since gi (R) and 
g2(R) are convex functions, it can be shown similarly that the 
optimal solutions for Ji ) i_ > i(s) and Ji.i->2(s) are given by 
and [-R^ls' > respectively. Thus, Ji,i(s) is also given by 
(fT5l l for the case of 7 ^ D < 7 ^ R . This completes the proof. ■ 
We give an intuitive explanation of Corollary [TJ Fig. |4] 
plots a typical graph of the objective functions g\ and g 2 and 
indicates their respective local optimal solutions R\ and R\, 
assuming 7 j D < 7 j R and so B 1 = R th . In general, we can 
express ( fT~4T > as 

Ji,i(s)= min g{R) (16) 

0<R<B° 

where g(R) 4 gi {R)t(R < B') + g 2 (R)t(R > B'). 
The objective function g(R) is piecewise convex, as both 
gi(-) and g 2 {-) are convex functions. To obtain the global 
optimal solution, it is sufficient to check for the boundary 
and local optimal solutions separately in each region where 
the convexity of the objective function holds, and then choose 
the global solution as the one that minimizes g(R). This check 
gives ( fTBI l. For example, given the rate domain < R < B', 
the local optimal solution is i?J and the boundary solutions are 
and B' . As the objective function g\ is convex, the optimal 
solution R\ in this domain must then be + . 

In general for any k, the minimization in Phase 1 is 
performed over a piecewise non-linear function, so as to decide 
whether the system state (k, 1) should transit to other states. 
For K = 2 slots, this decision can only be made in slot 1, 
and moreover the objective function is a piecewise convex 
function, thus allowing a closed-form solution to be obtained. 
For K > 3 slots, however, the decision can be made at any 
slot k = 1, ■ • • , K — 1 and the objective function may be a 
piecewise non-convex function, thus a closed-form solution is 
difficult to obtain. Therefore for K > 3, we resort to numerical 
methods in Section |V] based on Theorem [2] We note that the 
problem with no relay is already hard to solve, see @, fl3j . 

Despite the difficulty in obtaining closed-form solutions in 
general, we answer the important question of whether the 
optimal solution leads to bounded expected sum energy in 
the next section for any K. 

IV. Conditions for Bounded Expected Sum Energy 

In this section, we consider conditions for the minimum 
expected sum energy to be bounded for a general channel 
distribution. In particular, we show that the expected energy 
is unbounded in Rayleigh and Rician fading channels if 
relaying is not used. For simplicity, we assume the link SNRs 
7fc R '7fc D '7fc D are mutually independent and also independent 
over slot k, but follow the same class of channel distributions. 

For exposure, we first assume the link SNRs follows the 
truncated exponential PDF. Then we consider the general case. 
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H, 2 H x 



Fig. 4. Typical graph of objective function g(R). Here, g(-R) = 
g 1 (R)l(R < R th ) + g 2 (R)t(R > R th ) with the mutual information 
threshold R th , while R° represents the local optimal R for the objective 
function gi(R). 



A. Truncated Exponential Distribution 

Consider the truncated exponential PDF given by 



/ 7 (75 7) = cexp(-7/ 7 ) 



(17) 



for 7 > 7t runc and / 7 ( 7 ; 7 ) = otherwise. The param- 
eter 7 tmnc > is a SNR truncation threshold, and c = 
cxp(7 tmnc /7)/7 is a normalization constant. As 7t rur ,c —> 0, 
the truncated exponential PDF approaches the exponential 
PDF with average SNR 7, i.e., the channel amplitude follows 
a Rayleigh PDF. For convenience, we shall refer to 7 as the 
average SNR in general, even though it equals the expectation 
of 7 only if 7trunc 0. 

We assume all link SNRs follow the truncated exponential 
PDF with the same SNR truncation threshold 7 trU nc but with 
possibly different corresponding average SNR's 7 SR ,7 SD , 7 RD 
These distributions are considered in [2] where no relay is 
present, with the restriction that 7 trU nc is strictly positive. 
However, in the literature it is typical to assume a Rayleigh- 
fading channel for wireless communications, which corre- 
sponds to 7trunc = 0. As such, we also focus on the case 
where 7 t rU nc -> 0. 

For comparing the performance for different K, we define 
the normalized minimum expected sum energy (NMESE) 
Jjc (i? e ff ) as the expected sum energy per slot for transmitting 
i? e ff nats per slot. That is, 



Jk(R*b) = [Ji,i(si\K slots)] 



(18) 



where Ji^(si\K slots) emphasizes that the minimum ex- 
pected sum energy Ji,i(si) is for a K— slot system, and 
the initial system state is S\ = (71, bi) where hi = 
(KR e ff, KR e ff). For the case of no relaying, we denote its 
NMESE as J™ relay (i? e ff). Since relaying requires K > 2, 
clearly Ji(R e «) = relay (i? eff ). 

For K = 1,2, we obtain the following respective closed- 



form solutions (see Appendix let 

'1-HR*) 



Jl(i?eff) =1 



7 SD 



= $ 1 /- 1 (i? eff ) 



7! SD 



$ 1 /- 1 (2 J R eff - 
$ 2 /- 1 (2i? eff 



Rl) 



(19) 



(20) 



where $1, <E> 2 , -Ri and R% are defined in Section HIl-DI Here 
and henceforth, we assume the expectation is taken over all 
random variables and drop the subscript of the expectation op- 
erator. We simplify $1, $2 for our specific channel distribution 
to give (see Appendix [Cb 



$ 1 =E[l/ 7l SD ]= C SD E 1 ( 7tm nc/7 SD 



= E [l/ 7l ] = c bu Ei . 



7" 

7trunc 



„RDt^ / 7trunc 



7 



(21) 



(22) 



exp(— t)/tdt is the exponential in- 
tegral, 7 = HM(7^ D , 7 RD ) is the harmonic mean of 



Here, Ei (x) 

7 = 



SD 



7^ D and 7 RD , and k = cxp (7 tr unc/7) /l- Also, c 
exp(7 tmnc /7 SD )/7 SD ,c RD = exp( 7trunc /7 RD )/7 RD are the c's 
in ( fTTI ) corresponding to the respective links. 

Consider 7t ru nc — > 0, i.e., all links approach Rayleigh-fading 
channels. Theorem [3] states that the minimum expected sum 
energy (or the NMESE) is unbounded if there is no relaying 
for any K > 1, which is an extremely pessimistic result, 
but becomes bounded if relaying is used. Note that relaying 
requires K > 2. 

Theorem 3: Suppose that the SNRs for different links and 
different slots are independent and each link follows the 
truncated exponential PDF. Then the following holds for the 
NMESE: 

(i) Without relaying, J™ lelay (i? eff ) — > 00 as 7t runc - > for 
K> 1. 

(ii) With relaying, Jfs-(i? e ff) is bounded as 7 tmn c — > for 
K > 2. 

Proof: We prove the two parts separately. We will need 
Lemma H] in Appendix ID1 for part (i). 

(i) For K = 1, the NMESE with and without relaying is 
the same. From Q9), relay (s) = Ji(i? e ff) = $i/ -1 (i2eff), 
which approaches infinity because from Lemma |2] $1 — > 00 
as 7trunc — > 0. Subsequently, we assume K > 2. From Re- 
mark [2] the state diagram for the case without relaying can be 
obtained from Fig. [3] Suppose that the NMESE T° lelay (i? e ff) 
is bounded as 7t rur , c — > 0. Then in the optimal policy, we 
claim that the transition from system state (K, 1) to (K + l, T) 
occurs with zero probability. Otherwise, if the transition occurs 
with strictly positive probability, the contribution of the energy 
to transit to the system state (K + 1,T) is I^ 1 (B^ c )/^l D . 
Since the SNRs are independent over slots, the expected 
energy incurred is <I>iE[/ _1 (i3^-)], and it goes to infinity as 
7trunc ^ > 0. Thus, the transition to system state (K + 1,T) 
cannot occur. This implies that the transition from (K — 1, 1) 
to (K, 1) cannot occur as well (see Fig. [3] assuming there is 
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no system state in Phase 2). The state diagram for this A'-slot 
system thus reduces to the (A — l)-slot system. By repeating 
the same arguments that the state transition from (k, 1) to 
(k + 1, T) cannot occur for decreasing k = K — 1, • • ■ , 1, 
we are left with the state diagram of a one-slot system. But 
we have shown that J^ orelay (s) — > oo as 7 trU nc — > 0. By 



contradiction, relay (i? e ff ) is unbounded as 7 tm nc 

(ii) It is sufficient to show that a power allocation policy 
that achieves a bounded expected sum energy exists. First, 
consider K = 2. We use the following heuristic power 
allocation policy. S transmits in the first slot with power 

Pi 



0. 



1 ■ If 7i° > 7i R , *en D can decode the 



max{7^ D ,7j iR } ' 



message and so the transmission terminates. If 7^ < 7^ , 
then R can decode the message while D accumulates mutual 
information of amount R th = I (/ _1 (7? eff )7p/7^ R ). In the 
second slot, the stronger node of S or R transmits with power 
such that D reliably decodes the message. 



P2 = 



max{ 7 ^ D , 7 R D } 



The expected sum energy of this policy is then given by 



< E 



+ E[p 2 | 7 i S 
i- 1 {Res) 



SD 



<i! R ] 



max{7^ D ,7^ R } + max{7| D ,7 RD } 



7?° < 7! SR 



= /- 1 (i?aff) 



2/- 1 (i? eff )$ 2 . 



max{7p,7 RD } 



E 



.max{ 7 | D ,7 RD } 



(23) 



Here, the inequality is because i? th > and is an in- 

creasing function, the first equality is due to the independence 
of the SNR, and the second equality is because the average 
SNRs are the same over slots. By Lemma [2] in Appendix ITJ1 
$2 is bounded as 7t rur , c — > 0. As the optimal policy achieves 
at most the same energy as (1231 . Ji,i(s) and its expectation 
•hiRefi) are also bounded. 

Next, consider K > 3. For the first (A"— 2) slots, we can use 
any power allocation policy that gives a bounded expected sum 
energy, e.g., S transmits at some fixed power. For the last two 
slots, we employ the above heuristic power allocation policy 
for A = 2 to deliver the remaining mutual information to D. 
This policy, and hence the optimal policy, achieves a bounded 
expected sum energy for A > 3. ■ 

Theorem [3] gives the fundamental advantage of using re- 
laying compared to no relaying, from the perspective of 
minimizing energy. If no relaying is used, we must therefore 
accept some probability that the destination cannot be served 
in Rayleigh-fading channels in practice. If relaying is used, 
however, the destination can always be served. 

Consider the case of no relaying. Intuitively, if A = 1, 
the expected sum energy is expected to be unbounded due 
to the lack of channel diversity. If K > 2, Theorem [3] states 
that the NMESE is unbounded, even with an optimal policy. 
This may be surprising because under the assumption that full 
CSI is available, the NMESE is in fact boundecQ. Hence, it 

4 We give a sketch of the proof. We use the suboptimal power allocation 
policy that allocates all the energy to the slot with the largest SNR to deliver all 
bits to the destination. It can be shown, say by the use of ordered statistics, 
that the expected energy is bounded. Hence, the optimal policy must also 
achieve a bounded expected energy. 



is not obvious if the NMESE is bounded given only causal 
CSI. Given only casual CSI, the channels in the future are 
not known in advance, and based on Theorem [3] it turns out 
that it is difficult to effectively exploit the time diversity of the 
channels. In fact, given simply a one-slot-look-ahead CSI, i.e., 
assuming the CSI of the present and the next slot is known, is 
sufficient for the NMESE to be boundec@. This reinforces the 
fact that causality plays an important role in the boundedness 
of the NMESE. 

On the other hand, relaying can exploit spatial diversity over 
different nodes to give a bounded expected energy. This is 
because the SNRs of different links are available in the present 
slot, which can be used effectively. For example, consider 
the heuristic power allocation policy proposed in the proof 
of Theorem [3] which gives provably bounded expected sum 
energy. This power allocation policy exploits spatial diversity 
in both slots to give Pl = J x{j f^l } and p 2 = ^|^gj . 
In contrast, consider the following naive power allocation 
scheme with relaying over K = 2 slots: S transmits such that 
R decodes the message in the first slot, then R transmits in the 
second slot. Although the advantage of cooperative relaying 
(via spatial diversity) can be exploited in the second slot, the 
expected energy in the first slot is unbounded as 7 trunc — > 
(corresponding to a one-slot system). Thus, exploiting the 
spatial diversity over only one slot appears insufficient in a 
relaying system. 

So far, we restrict the study on the boundedness of the 
NMESE for the Rayleigh distribution, by using the truncated 
exponential distribution and letting 7t rur ,c — > 0. In the next 
section, we generalize the proof technique used in this section 
to arbitrary channel distributions. 

B. Arbitrary Channel Distributions 

We consider an arbitrary channel distribution with the same 
assumption that channels over different links and slots are 
independent. Theorem |4] gives conditions to determine if the 
NMESE is bounded. Without loss in generality, we assume 
that all average SNRs are the same, i.e., the channels are i.i.d. 
over links and slots. 

Theorem 4: Suppose that the SNRs for different links 
and slots are i.i.d. with cumulative density function (CDF) 
^(7), 7 > 0. Then the following holds for the NMESE: 

(i) Without relaying: relay (A e ff) is bounded if and only if 
$1 = E[l/7] is bounded. 

(ii) With relaying: Jk (Res) is bounded if $2 = 
E [l/max(7 SD , 7 RD )] is bounded. 

Proof: (i) "Only if part: Following part (i) of the proof 
of Theorem[3] it can be easily verified that if $1 is unbounded 
in general for any CDF, then J™ relay (A e ff) is unbounded. 
"If part: We allocate power such that D reliably decodes 
the message in the first slot. This incurs an expected energy 
of $i/ _1 (i?eff), which is bounded if $1 is bounded. Thus 

jno relay ^fl^j j s a J s0 bounded. 

5 To show this, we use the (suboptimal) power allocation scheme that 
allocates no energy for the first (K — 2) slots. Then we deliver all information 
in the last two slots. In the last two slots, full CSI is available, so we can 
achieve bounded NMESE. 
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(ii) Suppose we use the heuristic policy in part (ii) of the 
proof of Theorem [3] From ((23}, the expected sum energy is 
upper bounded by 2/ -1 (i? e ff)<f>2. If $2 is bounded, J/f(^?eff) 
must also be bounded. ■ 

Theorem |4] clarifies that $1 and $2 that depend only on 
the channel distribution are the key parameters for checking 
if the NMESE is bounded. As an application, we assume 
that each link SNR 7 follows the noncentral chi-squared PDF 
with degree of freedom v > and non-centrality parameter 
A > 0. Without loss of generality for determining whether the 
NMESE is bounded, let the mean be E[-y] = v. Thus, the CDF 
of 7 is J24) 

F X 2 (7; v, A) = 1 - Q v/2 (7a, Vt) , 7 > (24) 

where Q M {a,b) = x{x/a) M - 1 e-^ +a2 ^ 2 I M ^ 1 {ax) dx 
is the generalized Marcum Q-function and Im-i{') is the mod- 
ified Bessel function of order M — 1. Note that we can obtain 
the noncentral chi-squared distribution via 7 = 2~2i=i \Xi\ 2 
where X±, ■ ■ ■ ,X S are complex independent Gaussian vari- 
ables (not necessarily of zero mean or of the same variance), 
where s = v/2 is a positive integer J24). We can view this 
as the SNR distribution after maximal-ratio combining (MRC) 
of the received signals from s independent Rayleigh or Rician 
fading channels 11241 . This distribution reduces to the Rayleigh 
fading channel if v = 2 and A — > 0, and to the Rician fading 
channel if v = 2 and A > 0. 

Corollary 2: Suppose that the SNRs for different links 
and different slots are i.i.d. and each follows the CDF 
F X 2 (7; v, A), 7 > 0. Then the following hold for the NMESE: 

(i) Rayleigh fading channel: J™ relay (i? e ff ) — > 00 for K > 1 
and J k (Rett) < 00 for K > 2. 

(ii) Rician fading channel: J£? relay (i? e ff ) — > 00 for K > 1 
and Jff(-Reff) < 00 for K > 2. 

(iii) Noncentral chi-squared channel where v = 2s and 
s_ e {2,3,---}: J™ relay (i?eff) < 00 for K > 1 and 
Jk {Res) < 00 for K > 2. 

Proof: We prove the three parts separately. 

(i) The PDF of the Rayleigh fading channel is given by 
the exponential PDF /exp(t) = e^/ 2 / 2 - This PDF is also 
obtained by the truncated exponential PDF with 7 trun c — > 0. 
From Theorem [3] we have already obtained J™ relay (i? e ff) — > 
00 for K > 1 and Jk{Rss) < 00 for K > 2. 

(ii) First, consider the case of no relaying. The PDF 
of the Rician fading channel is given by ^2(7; 2, A) = 
e -h+A)/ 2 I (y/Xy)/2. Since Iq{-) is an increasing function 
and J o (0) = 1 125], e- A / 2 / EXP (7) < ^2(7; 2, A) for 7 > 0. 
Thus e - A / 2 $f xp < $f\ where $f xp and ®f denote E[l/ 7 ] 
under the exponential and chi-squared PDFs, respectively. 
From part (i), the NMESE is unbounded for Rayleigh fading 
channels. By part (i) of Theorem |4] (the "only if part), 
<f>f xp — > 00, and hence — > 00 too. Using part (i) of 
Theorem g] (the "if part), we conclude that the NMESE is 
unbounded for Rician fading channels. 

Next, we sketch the proof for the case of relaying. We 
associate the SNR of a Rician fading channel with the SNR 
of a Rayleigh fading channel with the same non line-of-sight 
component. Clearly, $2 of the first channel is always lower 



bounded by $2 of the second channel. From part (ii) of 
Theorem [3] (the "only if part), $ 2 of the Rayleigh fading 
channel is always bounded. Thus, <£> 2 of the Rician fading 
channel is also bounded. By part (ii) of Theorem |4] the 
NMESE with relaying is bounded. 

(iii) First, consider the case of no relaying. Suppose A = 0. 
Then the PDF of the SNR is a central chi-squared distribution. 
By direct integration, we obtain E[1/7|A = 0] = 2 (s-i) = 
^central < ^ jh us ^ t h e NMESE is bounded without relaying. 
Now suppose A > 0. From 11261 Theorem 1], the generalized 
Marcum Q-function QM{a,b) is strictly increasing in M and 
a for all a > and M, b > 0. It follows from dUJi that 
Fj{j;v, A) < i 7 ! 7 (7;w,0). We say that the random variable 
with A > (call it 71) first-order stochastically dominates the 
random variable with A = (call it 72). From decision theory 
ll27l . it is known that E[u(7i)] > E[u(7 2 )] for any increasing 
(utility) function u if 71 first-order stochastically dominates 
72. Substituting u{j) = — 1/7 then gives E[l/7i|A > 0] < 
^central 5 mce we jj ave sn0 wn that $= entral < 00, it follows that 
E[l/7i|A > 0] < 00. Finally, with relaying the NMESE must 
also be bounded (by explicitly allocating no power to the relay 
as a suboptimal policy). This completes the proof. ■ 

Part (iii) of Corollary |2] shows that if more than one 
independent channels are present for SNR combining, the 
inherent diversity present is sufficient for the NMESE to be 
bounded, even without relaying. Intuitively, parts (i) and (ii) 
Corollary |2] suggest that Rayleigh and Rician channels do 
not have sufficient diversity and relaying is still necessary. 
Thus, even a strong-line-of-sight signal component (in Rician 
channels) is not sufficient for energy-efficient communications 
and so some form of diversity technique is still useful. 

V. Numerical Results 

Closed-form solutions cannot be easily obtained for K > 3 
in general, as explained in Section ITlI-DI Nevertheless, numer- 
ical computation of the minimum sum energy can be obtained 
using Theorem |2] by discretizing the state space of the causal 
CSI Sfc = (~fk, -Bfc, -Bfc) for all system states. To this end, 
we discretize and to take values in the set B = 
{0 to i? e ff in discrete steps of A}. Next, we employ a Monte 
Carlo approach to compute the expectations in the Bellman 
equations; the computational complexity of this approach is 
small compared to the alternative approach of discretization 
of the link SNRs jk and their probability spaces. That is, 
we obtain N s [ m independent realizations of 71 , • • • , jk with 
distribution (O where each link SNR follows the PDF dT7T >. 
Then we perform a backward recursion of the Bellman's 
equation with k initialized as K. For our numerical results, 
we set A = 0.01 and N s [ m — 5000. Specifically, we perform 
the following steps: 

• Use ( TO! ) to compute Jk,c/, k {sk) for each cf)k £ {1>2}, 
B^, B® € B, and for each realization of -f^. If k = K, 
we can use dl lal ) immediately. 

• Obtain an approximation of the expectation 
E 7fc [Jk„$ k {s)] by averaging over all j k . 

• Repeat the first two steps with k decremented by one. 
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Fig. 5. Normalized minimum expected sum energy to deliver R, e ff nats per 
slot per channel use over K slots. Relaying and 7trunc = 10 — 3 . 
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Fig. 6. Normalized minimum expected sum energy to deliver R c ff nats per 
slot per channel use over K slots. No relaying and 7tmnc = 10 -3 . 



We assume the SNRs are i.i.d. over different links and slots 
according to the truncated exponential PDF with average SNRs 
7 SD = 7 RD = 7 SR — 1; thus, any energy incurred may be 
taken as normalized with respect to the average SNR. We 
use the same SNR truncation threshold 7 trU nc = 1CP 3 . Fig. [5] 
shows the NMESE J K (Rett) defined in (18) for various K. 
The improvement is significant initially when K is increased, 
but less so at larger K. Our system setup may be considered 
a worst-case scenario, since typically R is located somewhere 
between S and D which implies that 7 s0 <C 7 RD and 
7 SD -C 7 SR - Nevertheless, the NMESE is still reduced when 
compared with Fig. [6] where no relay is used. For example at 
i? e ff = 1 nat, the reduction is more than 1 dB and 0.5 dB for 
K = 2 and K = 5, respectively. 

To investigate the effects of approaching a Rayleigh-fading 
channel, we reduce the SNR threshold to 7 trun c = 10 -10 . From 
Fig- 13 we see m at the NMESE is almost unaffected if relaying 
is used, while the NMESE clearly increased if no relaying is 
used. Fig. [7] also highlights the significant progressive drop in 
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grows arbitrarily large as 




K=1 

a - k=2, no relay 
-0 — K=2, relay, heuristic policy 

- K=2, relay, optimal policy 

- K=5, relay, optimal policy 



0.5 



1.5 



Ft per slot (nats) 



Fig. 7. A comparison of the normalized expected sum energy per slot per 
channel use over K slots at different simulation conditions, assuming 7tmnc = 
10 . A heuristic policy that is guaranteed to achieve a bounded expected 
sum energy is included for comparison. 



NMESE as (i) K — lis increased to K = 2 without relay, 
(ii) relay is added for K = 2, and (iii) K = 2 is increased to 
K = 5 with relay. We also include a heuristic power allocation 
policy proposed in the proof of Theorem [3] for K = 2, which 
is shown to achieve bounded sum energy even for 7 trU nc — > 0. 
In this scheme where K = 2, S uses the minimum power to 
transmit in the first slot such that R or D decodes the message; 
if only R decodes, S or R that has the stronger link to D 
transmits such that D can decode the message. From Fig. [7] 
the NMESE of the heuristic policy is close to the optimal 
scheme especially at small i? e ff. To achieve a good complexity- 
performance tradeoff, other heuristic policies for different K 
can also be devised and compared against the optimal schemes 
presented here. 

VI. Conclusion 

In this paper, we consider the problem of energy minimiza- 
tion of a slotted relay system that guarantees data delivery 
within a deadline, assuming the availability of causal CSI. 
We have obtained analytical solutions and provided conditions 
on which the expected sum energy is bounded. Our results 
indicate that if full CSI is available, or if the channel is 
sufficiently rich in diversity, then the expected energy required 
is bounded. However, the expected energy becomes unbounded 
if only causal CSI is available (as is typical in practice) 
and if the channel is not sufficiently rich in diversity, such 
as Rayleigh and Rician channels that are representative of 
wireless channels. To make the expected energy bounded, 
advanced relaying with adaptive power allocation is necessary. 
Thus, we provide an alternative viewpoint of the advantage of 
relaying from the energy-efficient perspective. 

An interesting future direction is on the analysis of the 
advantages of relaying in multi-user and multi-carrier systems 
from the energy-efficient perspective, compared to the case 
where no relaying is used such as in ll28l . As the exact analysis 
appears challenging, further asymptotic analysis is needed 
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to yield insights. The development of heuristic policies with 
provably good performance may also yield further insights for 
practical implementation. 

Appendix A 
Proof of LemmaQ] 

We first prove that there is no loss of optimality if only 
either S or R transmits in each slot k in Phase 2, where 
K + 1 < k < K and K satisfies the constraint d6bl i. 
Consider 7 jp > 7 RD . Suppose the optimal power allocation 
Pfc = [Pfc*'Pfc*] satisfies p R * > and p|* > 0. A necessary 
condition for p£ to be optimal is that it satisfies the constraint 
( |6c] >. By construction, we choose the non-negative powers 

Pfc = (Pk^Pk) where Pk = and pI = pI* + Pk*ik D hl D - 

Since Pfc*7/J D +p R *7j? D = Pfc7fc D +Pfc7fc D ' Pfc satisfies the con- 
straint ( f6cb . i.e., this is a feasible solution to Problem P0. 

Now, we write p\ + Pk = Pk* + Pfc*7fc D /7fc D < Pfc* + Pfc* 
since 7^° > 7 RD , and so p^. achieves a smaller sum energy 
than p£. By contradiction, pj£ is not optimal. Hence, it is 
optimal that only S or R transmits. The case of jf D < 7 RD 
can be proved similarly. Finally, consider 7^° = 7 RD . Any 
non-negative (p R ,p|), such that p R + p| is fixed, does not 
affect the constraint nor the objective function. Hence, no loss 
is incurred if p R * = or p\* = 0. Now, given that only 
one node transmits, we complete the proof of Lemma Q] that 
it is optimal for the stronger node to transmit. The proof 
is by contradiction: suppose only the weaker node with a 
strictly smaller SNR to the destination transmits, then a higher 
power needs to be allocated to satisfy the constraint ( l6cl >. This 
implies a smaller sum energy, i.e., a strictly suboptimal power 
allocation. 



Appendix B 
Proof of TheoremQ] 

We use the Bellman's principle of optimality to solve 
Problem PI recursively, starting from slot k = K until k = 1 
||20l . It is useful to refer to the state diagram in Fig. [3] 
Given causal CSI in slot k, 1 < k < K, let the sub- 
policy consisting of the power allocation of slots k, ■ ■ ■ ,K, 
be 7T fc (s fe ) = {(p fe (sfc),Pfc+i(sfe+i),- •■ ,Pk(sk)) > 0,Vs, G 
Si, i = k + 1, • • • , K : © holds}. 

Consider the last slot K in Phase <fr <G {1,2} given 
causal CSI s. The minimum energy incurred for slot K in 
Problem PI, where the optimizing variable is p_ff(s) = pk, is 



mm pk ■ 

p K (s)67r K (s) 



(25) 



The constraint (l8bl in ttk (s) is active only for the last slot, 
and so is equivalent to B K+l < 0, i.e., the state must transit to 
(K + 1, T). From ©, we get B K < I{pk1 S k) if 4> = L and 
B K < I(pk1k) if <fr = 2- Thus, Jk.4>(s) can be obtained by 
(II 1 at for slot K . Next, consider the next-to-last slot k = K — 1 
in Phase <\> 6 {1,2} given causal CSI s. We denote the next 
phase that the slot will transit into as <f>' and the next causal 
CSI as s'. The minimum expected sum energy for slot K — 1 
and slot K in Problem PI, where the optimizing variables are 



Pif-i(s) =px-i and the set {pk (s'),Vs'}, is then given by 

Jk-i,^>(s) 

Pk-i +1£V,s' \Pk\4>, s] 



mm 

(Pk-i (s),p_R:(s'),Vs')e7rj<_i(s) 



mm p K -\ 

°A--l(s)>0 

min Pk-\ 

°R--i(s)>0 



mm pk 

Pk (s')e-jr K (s') 



V,s' [Jk,<P'(s')\4>, s] . 



In general, let Jk.4>(s) denote the minimum expected sum 
energy for slot k until slot K in Problem PI given phase </> 
and causal CSI s. Repeating the proof similarly for decreasing 
k = K-2, ■ ■ ■ , 1, we obtain ( fTTB for all k g K\K. Finally, if 
(j> = T, the constraint ( |8bl is satisfied and so it is optimal that 
p£ = 0, hence we obtain dl let . By backward recursion until 
k = 1, we obtain Ji i(s), which is the minimum expected sum 
energy for all slots since the system must start in Phase 1 . The 
optimal policy tt* that achieves Ji,i(s) is clearly given by the 
power allocation that solves the backward recursive equations. 

Appendix C 
Derivation of (fT9li - (|22T) 

We obtain ( fl9] l from (11 lat since the first slot equals 
the last slot and relaying is not possible, and we ob- 
tain ( f20b by applying Corollary Q] The derivation for 
(I2TI 1 follows by direct substitution and algebraic manipu- 
lations. The derivation for (l22l follows from the distri- 
bution of 7 = max{7 SD , 7 RD }, which can be expressed 
as = J^(x) = |:Pr( 7 SD <x, 7 RD <x) = 

^:P 7 sD(a;)P 7 RD(a;) = / 7 sd(x)P 7 rd(x) + / 7 Ro(a;)P 7 sD(a;) for 
x > 7tmnc, where Fx(-) denotes the CDF of X. After some 
algebraic manipulations, we get $2 as in 



Appendix D 
An Asymptotic Result 

From (I2TI 1 and (l22l . we can express 

$1 =a(7 SD ,7tmnc) (26) 
$2=6(7 SD ,7 RD ,7trunc) (27) 

for 7 SD ,7 RD > 0, where we denote 

6 {x, e) = exp (e/a:) Ei (e/x) /x (28) 
6 (x, y, e) = 6 (x, e) + (y, e) - & (HM(x, y), e) (29) 

for e > and x > 0, where Ei (x) = exp(— t)/t dt is 
the exponential integral and HM(x,y) = (1/x + l/y)^ 1 is the 
harmonic mean of x and y. 

Lemma 2: For small e, we have the following asymptotic 
results: 



ti(x,e) = -- (V + log f 1 )) + O(elog(ej) 



(30) 



60, y, e) = - log (l + - ] +-Iog(l+^ +0(elog(e)). 
x \ yj y \ xJ 

(31) 

where 7' is the Euler's constant. As 7 trU nc — > 0, we have 
$1 00 (32) 



1 / 7 SD 

*2^^SDl0g^l + ^RD 



7 



lo K (l + 7 

7 



RD 



RD ~ 



SD 



<oo.(33) 
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Proof: We use the asymptotic results that exp(e) = 1 + 
e + C(e 2 ) and Ei (e) = -7' -log e + 0{e) JB] Section 5.1.11] 
for small e, to obtain ([30b and (|3TT l after some algebraic 
manipulations. Since — log(e) — > 00 and elog(e) —> as 
e —> 0, clearly $1 and $2 given in (|26| | and d27l i. respectively, 
approach their respective bounds in Lemma |2] ■ 
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